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Abstract. The operator defined by convolution, in R d , with (affine) surface mea- 
sure on a paraboloid satisfies a dilation-invariant LP — > L q inequality, and enjoys 
a high-dimensional group of symmetries. Extremal functions are shown to exist 
for this inequality. Moreover, any extremizing sequence is shown to be precompact 
modulo this symmetry group. 



1. Introduction 

If X, Y are Banach spaces, and if T : X — > Y is a bounded linear operator, then by 
an extremal vector for the associated inequality ||Ts||y < C||a;||x is meant a nonzero 
vector x & X such that ||Tx||y = ||T|| • The following are natural questions: 

(1) What is the norm of Tl 

(2) Does there exist at least one extremal vector xl 

(3) Are extremal vectors unique, modulo scalar multiplication, or modulo ex- 
plicitly known symmetries enjoyed by T? If not, then what is the set of all 
extremal vectors? 

(4) If a; is a quasiextremal vector in the sense that ||Tx||y > c\\T\\ \\x\\x for a spec- 
ified scalar c < 1, then what can be said quantitatively about the structure 
of xl There are three natural regimes for this question: (a) for fixed c, (b) as 
c — > 0, and (c) as c — > 1. 

(5) What quantitative properties do extremal vectors enjoy? 

(6) One can likewise define extremal and quasiextremal pairs (x,y) G X x Y*, 
where Y* is the dual of Y; an extremal pair is one which satisfies \y(Tx)\ = 
ll^ll \\v\W* \\ x \\x- Each of questions (2) through (5) has an analogue for such 
pairs. 

This paper investigates these questions, for a specific operator which arises in 
Euclidean harmonic analysis. This operator is modestly fundamental, as a basic 
example of a Fourier integral operator, of a Radon-like transform, and of an object 
whose analytic properties are governed by geometry and combinatorics. Within those 
frameworks it occupies a unique niche, attested to by an associated high-dimensional 
symmetry group, connected to the Heisenberg group and metaplectic representation. 

Regard M. d as R d_1 x R with coordinates x = (x',Xd)- Let a be the singular 
measure on M, d , supported on the paraboloid {x : Xd = |^'| 2 }, defined by j f da = 
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J R d-i f(%', \ x '\ 2 ) dx'. a and surface measure on the paraboloid Xd = \x'\ 2 are mutu- 
ally absolutely continuous, but a enjoys a certain dilation symmetry which surface 
measure lacks. See EfTTlfor discussion of a general context for such a measure. 
The convolution operator and inequality under discussion are: 

(1.1) T/ = /*<7 

(1-2) ||T/||2,d+i( R d) < A||/|| L (d+i)/d( M d) 

for all / G C° fl L°° R L 1 (R d ). Here A < oo denotes the optimal constant in the 
inequality, which depends only on the dimension d. For some discussion of the fun- 
damental nature of this inequality, see [3]. 

In this paper we establish the existence of extremals, and of extremal pairs, for 
T in all dimensions. Moreover, we obtain uniform quantitative information about 
the behavior of extremals, and of (1 — <5)-quasiextremals with 5 close to 0. We show 
that sequences of approximate extremizers are precompact, after renormalization 
by an explicitly described symmetry group. This complements our earlier work [3] 
concerning c-quasiextremals, in the regime where c is not close to 1. Various natural 
questions remain open: We neither identify these extremals and the optimal constant 
A, nor address the question of their uniqueness modulo symmetries, 

One of our objectives is to begin to develop techniques which should be useful in an 
analysis of extremals for other closely related operators. See [3] for one such problem. 
This paper is the second in a series treating aspects of the meta-question: If the ratio 
$(/) = ll^/Hq/H/Hp is large, then what are the properties of /? The word "large" 
admits various interpretations. The initial work [3] is a study of those functions / 
for which the $(/) is bounded below by some positive constant. In [6], qualitative 
properties of arbitrary critical points of $ are studied. The paper [I] demonstrates 
an equivalence between the inequality studied here, and a certain inequality for the 
Radon transform, and explicitly identifies all extremizers for both. This equivalence, 
together with an rearrangement inequality proved in pQ and special considerations for 
the subclass of all radially symmetric functions, could be used to give an alternative 
proof of the existence of extremizers for $. However, the arguments given in the 
present paper, which ultimately rely on qualitative rather than exact symmetries, 
are more general and therefore retain some interest. One problem which has such 
qualitative but not exact symmetries is studied in [5]. 

We are indebted to Rene Quilodran and to Shuanglin Shao for useful advice on 
the exposition, and to Terence Tao and Shuanglin Shao for posing related questions 
to us. 

2. Results 

Define 

A= sup ||T/|| d+1 . 

Il/ll(d+l)/d = l 

Definition 2.1. An extremizer for the inequality (11.21) is a function / e L^ +1 ^ d (IR ci ) 
which satisfies ||T/|| L d+i = A||/|| L (d+i)/d ^ 0. 
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An extremizing sequence for the inequality ( II. 2p is a sequence of nonnegative func- 
tions f v G L( d+1 V d satisfying 

(2-1) = 1 

(2.2) ||TMU*fx -»A. 

For any 5 G [0, 1), / G L { - d+v > /d is a (1 - 5)-quasiextremal for (TOj) if 

(2.3) ||T/|| Ld+1 > (1 - 5)A||/|| £0 M-i)/a ^ 0. 

In Definition 13.11 below we will introduce a group Gd of diffeomorphisms of R d 
which are natural symmetries of our problem. Associated to each </> G Gd is an 
invertible linear operator <fi* on L^ d+l ^ d {R d ) satisfying ||0*/ll(<m)/<i = ||/||(rf+i)/d an d 
||T(0*/)IU+i = ||T(/)|| d +i for all / G L { - d+l ^ d . Thus for any sequence of functions 
f u and elements (fi u G £d, (fv) is an extremizing sequence if and only if (0*/„) is an 
extremizing sequence; extremizing sequences can only be characterized modulo the 
action of Gd- 

Theorem 2.2. (i) There exist extremizers for the inequality (11.21) . 
(ii) Let {fv} be any extremizing sequence for the inequality (II. 2p . Then there exist an 
extremal f for (II. 2p satisfying \\f\\(d+i)/d = 1, a subsequence {f Ui }, and a sequence 
of symmetries fa G Gd such that <f>*(f Vi ) f in L^ d+1 ^ d norm. 

(Hi) There exist a constant Co < oo, a function \l/ : [0, oo) — > [0, oo) satisfying 
V(t) > t^ d+1 ^ d for all t and m(t) /t {d+1)/d -> oo as t -» 0+ and also as t oo, and 
a function p : [l,oo) —> (0, oo) satisfying p(R) — > as R — > oo, with the following 
property. For any extremizing sequence {f u } for the inequality (II. 2p . there exists 
a sequence of elements <p v G Gd such that for all sufficiently large v, <p* v f v can be 
decomposed as g u + h v so that ||/ij/||(d+i)/d 0, 

(2.4) / V(gv)<C ; 
for all R>1, 

(2.5) / g u (x) {d+1)/d dx<p(R). 

J\x\>R 

Moreover for any R > 1, for all v > v(R), g v may be further decomposed as g\ + g\ 
where both summands continue to satisfy (12.41) and (12. 5p . and moreover 

(2.6) H^llci < R and \\gl\\ L w/* < T](R) 

where rj(R) — > as R — >■ oo, and the function r\ is independent of the extremizing 
sequence. 

(iv) For any nonnegative extremizer f for (11.21) satisfying ||/ ||(d+i)/d = 1, there exists 
<f)eGd such that <p*f satisfies (12TD . (I23|) . and tfXM . 

(v) Any complex-valued extremizer for (11.21) agrees almost everywhere with e l6 f for 
some C°° nonnegative function f and some constant 9 G R. 
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(vi) Any nonnegative extremizer for inequality (jl.2p with = 1 satisfies the 

Euler- Lagrange equation 

(2.7) T*([T/] d ) = A d+1 f 1/d 

almost everywhere. For any compact set K C M. d there exists c > such that f(x)>c 
for almost every x G K. 

In conclusion (vi), T* denotes the transpose of T. 

Remark 2.3. Extremizers of (11.21) do not belong to the Schwartz class. Indeed, if / 
is a continuous nonnegative solution of the Euler-Lagrange equation ( 12. TP which does 
not vanish identically, then limsupui^^ \x\ d / 2 f(x) > 0. For Tf is bounded away from 
zero in some ball B. The equation and nonnegativity together force f 1 ' d > cT*(xb), 
and consequently f(x', Xd) > c(l + |x'|) _d for all (x', Xd) belonging to a certain tubular 
neighborhood, of constant width, of a paraboloid. In such a region, \x'\ ~ \x\ 1 ^ 2 as 
\x\ — > CO. 

Quasiextremals for the inequality (II. 2p were studied in [3]. These are by definition 
functions which satisfy ||T/|| d+1 > ||(<f+i)/<f for an arbitrary constant e > 0; e need 
not be close to the optimal constant A. The analytic techniques introduced there 
were further developed in [TU] to treat quasiextremals for the corresponding inequality 
for T S phere- These techniques form the basis of the present paper. However, we 
have structured the exposition to emphasize certain geometric facets of the subject, 
especially the symmetry group and family of paraballs introduced in £J3], which were 
less fully developed in [3]. 

Throughout the paper we assume that the dimension of the ambient space M. d 
satisfies d > 2. Define 

p = — - — and q = d + 1. 
d 

Since \T(f)\ < T(\f\) and |||/||| p = ||/|| p , there exist extremal functions for the 
inequality in question if and only if there exist nonnegative extremal functions; we 
assume henceforth, without loss of generality and without further comment, that 
all functions under discussion are nonnegative. c, C, 7 will denote finite positive 
constants which depend only on the dimension d, and which are permitted to change 
values from one occurrence to the next. Typically c will be small, while C will be 
large. 

3. Preliminaries 
We begin by collecting various facts which will be used in the analysis. 

3.1. Symmetries. The operator T enjoys a relatively high- dimensional Lie group 
of symmetries. Let G : M. d+d — > M be the function 

Q(x, y) = x d - y d - \x' - y'\ 2 

and let X be the incidence manifold X = {{x,y) E R d+d : Q(x,y) = 0}. Diff (R n ) 
denotes the group of all C°° diffeomorphisms of M™. 
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Definition 3.1. Q djd denotes the set of all ordered pairs (0, ip) G Diff (R d ) x Diff 
which preserve I in the strong sense that there exists ^ X G R such that 

(3.1) e((f>(x),^(y)) = XO(x, y) for all (x, y) G R d+d . 

Qd denotes the set of all G Diff (R d ) for which there exists -0 such that (0, -0) G 
Qd,d- 

In particular, (x,y) Gl^> ((f)(x),ip(y)) G X. 
The following are examples of elements of Qa,d- 

• (<f>(x),if)(y)) = (x + v, y + v), for any n6M' i 

• {(f)(x)^(y)) = (rx', r 2 x d ; ry', r 2 y d ), for any r G R \ {0} 

• ((f)(x),ip(y)) = (x' + u, x d + 2u • x' + \u\ 2 ; y', y d + 2u ■ y'), for any u G R d_1 

• {<t>{x)My)) = (L(x'),x d + \Lx'\ 2 - \xf;L^y'),y d - \L^(y')\ 2 + \yf), for any 
invertible linear transformation L : R d 1 — > R ' 1 , where is the inverse of 
the transpose of L. 

Denote by r the involution x h-> —a; of R d . 

Lemma 3.2. ^ The correspondence {<f),ip) <p is a bijection of Gd,d with Q d . 
(ii) If (0, ip) G Q d)d then t oip or e Q d . 

(Hi) Every G Q d is of the form (x', x d ) >->■ (La;', Ac^ + /or some invertible 

affine endomorphism L ofM. d ~ 1 , some invertible affine endomorphism A o/R 1 , and 
some polynomial Q : R rf_1 — > R of degree < 2. The pair (L,A) is unrestricted. Q 
takes the form Q = q + a where q is a homogeneous polynomial of degree 2 uniquely 
determined by (L,A), while a is an arbitrary affine mapping. 

Proof, (ii) is a direct consequence of the definitions. 

To establish (iii), write <f>(x) = (F(x),f(x)) G R d_1 x R and similarly ip(y) = 
(G(y),g(y)). To the equation 

/(*) - g(y) - \F(x) - G(y)\ 2 = \{x d -y d - \x' - y'\ 2 ) 

apply any mixed second partial derivative d °. dy to conclude that 

/dF(x) dG(y)\ . . , 

( — , — ) is independent of x, y. 

\ oxi oy,j I 

Since R d 3 x i— > F(x) G R d_1 is a submersion at every x, it follows that VG(y) is 
independent of y; similarly VF(x) is independent of x. Thus F, G are affine functions. 

If i = d or j = d then d 2 /dxidyj annihilates x d — y d — \ x' — y'\ 2 , so it must annihilate 
F(x) ■ G(y). It follows again from the submersion property that F, G are independent 
of x d , y d . 

By comparing terms we see that F(x) ■ G(y) = Xx' ■ y' plus an affine function of 
x' ', y'. Therefore F, G take the form F(x) = Ax' + u and G(y) = XBy' + v, where B 
is the transpose of A" 1 and u, v are vectors in R d_1 . 

Now f{x) — g{y) equals Xx d — Xy d plus a quadratic polynomial in x 1 , y' . This forces 
/ to be a quadratic polynomial in (x 1 , x d ) in which the coefficient of x d Xj vanishes for 
all 1 < j < d. Likewise for g. The other parts of the description of 0, -0 now follow 
from the equation. 
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In order to prove that (0, ip) — > <j) is a bijection, which was conclusion (i), it suffices 
to prove that if <p is the identity, then so must be ip. We already know that ip must 
take the form ip(y', yd) = (Ay' + v, g(y)). Thus 

%d - g{y) - \x - y' - v\ 2 = \(x d - y d - \x' - y'\ 2 ). 

Equating coefficients of x d forces A = 1. Therefore y d = g(y) + 2v ■ (x' — y') + \v \ 2 . 
Since the left-hand side is independent of x', v must vanish, leaving g(y) = y d . □ 

Some further information concerning Q d may be found in CLOl below. 

For each <ft G Qd, the Jacobian determinant : M. d — > (0, oo) of is a constant 
function. Associated to each element <f> G Q d is the mapping <p* : L^ d+1 ^ d (M. d ) — > 
L (d+i)/d^ defined by (j)*f( x ) = f((j)(x))J d ^ d+1 \ Then 

, s Wf\\{d+l)/d = ll/H (d+l)/d 

( ' ' (Pg,T{<Ff)) = (g,T{f)) 

for all f,ge L^l d . 
3.2. Paraballs. 

Definition 3.3. Let n, • • • ,r d -i,p > 0, let e = {ei, • • • , e^-i} be any orthonormal 
basis for let r = (ri, • • • , r^-i) G (0,oo) rf_1 , and let z = G X, that is, 

z 6 I 11 x K d satisfies (x+) d — x d = \x'+ — x'\ 2 . The paraball B = B(z, e, r, p) associated 
to these data is the set of all x G M> d satisfying 

d-l 

(3.3) £r7 2 |<x'-x', ej -)i 2 <l, 

3=1 

(3.4) - - | a;' - ^| 2 | < p. 

Here 5 = (ac',5 d ) G M d and x* = {x'„ (x*) d ) G M d . 

For any A > 1, the expanded paraball XB(z, e, r, p) is defined to be the set of all 
iGK d satisfying 

d-l 

(3.5) ^r7 2 |(x'-x',e,)| 2 <A 2 , 

(3.6) |x d - (x*)d - |x' - x*| 2 | < Ap. 

The dual paraball B+ = B*(z,e,r,p) associated to these data is the set of all 
x* = (x+, (x*)^) G M> d satisfying 

d-l 

(3.7) E( r ?r 2 K^-^ e ;>i 2<1 

3=1 

(3.8) \(x*)d - x d + \x+ - x | 2 | < p 

where r^r* = p for every j G {1, 2, • • • , d — 1}. 

<B(z, e, r, p) denotes the ordered pair = (B(z, e, r, p), .B*(z, e, r, p)). 
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B uniquely determines B+, and vice versa. The point z = (x, x+) is an element of 
W d x R d , not of ~R d . The data (x, e, r, p) do not suffice to completely determine B; 
the point x can be regarded as a "center" of B, but the geometry of B depends also 
on x*. 

The mapping from data (z, e, r, p) to paraballs is not a one-to-one correspondence; 
0(d — 1) acts naturally on the set of all orthonormal bases e, and if e, e belong to 
the same orbit under this action then B(z, e, r, p) = B(z, e, r, p). 

Given <p G and a paraball B, {x : <f)(x) G -B} is a paraball. For <fi can be expressed 
in the form (j)(x',x d ) = v + (L(x'), \x d + Q{x')) where L G GZ(d - 1), ^ A G R, 
v G R d , and Q : IR d_1 — )■ R 1 . Then L*L is a positive definite symmetric matrix, 
hence admits a factorization L*L = 0~ x DO with O G 0(<i — 1) and D a diagonal 
matrix with positive diagonal entries. The condition |L(x')| 2 < 1 is equivalent to 
| (D 1 / 2 0(x l ), D l l 2 0(x')) | < 1, which can be expressed in terms of e, r where e is the 
image of the standard basis for R d_1 under O, and (r 1; • ■ ■ ,r d -i) are the diagonal 
entries of D 1 ^ 2 . Thus (I3.3P transforms under <\> to another inequality of the same 
form. This is the main step in establishing the first conclusion of the next lemma. 

Lemma 3.4. There are natural actions of Q d on the set of all paraballs, and of Qd,d 
on the set of all dual pairs of paraballs. These actions are transitive. 

The simple remainder of the verification is left to the reader. 



3.3. Distance between paraballs. It will be useful to quantify the notion that 
two paraballs are far apart. An approximate measure g(B^,B b ) of the discrepancy 
between two paraballs is defined as follows. 

To any paraball B = B(x, e, r, p) is associated the balanced convex subset C C R d_1 
defined by 



Much of the geometry of B is encoded by C; B is the set of all (x',Xd) such that 
x' —x' G C and \xd — h{x')\ < p, for a certain quadratic polynomial h specified by x*. 

In the following definition, C\ & denote the convex sets associated to B^,B^, re- 
spectively. 



d-i 



C = {veR d - 1 :J2rJ 2 \(v,e 3 )\ 2 <l 



} 
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Definition 3.5. Let = B(z*, e\ r fl , p s ) and B b = B(z\e\r\p b ) be two paraballs, 
where z^ = (a;", x{), z b = (x , x^) are two points in X. Define 

W'*) min(p*,^) + (rj) 2 + ^ ( r J)2 



3=1 K'j) 3=1 Vj' 



31 



+ ,^,M2 + L 



+ |4 |^-^|| + \x\-{x\) d -\x^'-x\'\ 



p5 p p 

In particular, g(B*, B b ) > 1 and g(B*, B\>) = g(B\ B*) for any S«, J3 b , and g(B, B) = 
1 for any B. 

g is one of many ways in which the difference between two paraballs can be quan- 
tified, and other variants would serve our purpose equally well. Of primary interest 
here is the situation in which two paraballs differ markedly, rather than when they 
nearly coincide, g is not actually a metric; for instance, g(B$, B ) > 1 for any B b . 

The following two invariance properties are direct consequences of Definitions 13.31 
and 13.51 

Lemma 3.6. For any (j) 6 and any two paraballs B , 

(3.9) £(0(y3»),0(y3 b )) = g{B\B b ). 
For any two dual pairs of paraballs (B\B\) and (B b ,Bl), 

(3.10) g(B*,B b ) = g(Bt,Bi). 

The next lemma relates the distance between two paraballs to the relative size of 
their intersection. 

Lemma 3.7. There exists a constant C < oo which depends only on the dimension 
d, such that for any two paraballs B\B b , 

(3.11) 



max(|73 tt |, |. 






Bi r\B l 





There is of course no converse inequality; the right-hand side becomes infinite 
whenever B$ (1 B b = 0. 

The following elementary fact will be used in the proof: For any d > 1 there exist 
constants C, c G (0, oo) such that for any convex set C C M d of positive Lebesgue 
measure, and any quadratic polynomial Q : M. d — > K. which does not vanish identically, 
for any e > 

\{yeC:\Q(y)\<e sup \Q\}\<Ce c \C\. 
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Proof of Lemma \3. 7[ We need to show that a lower bound on g(B^,B 9 ) implies an 
upper bound on \B$C\B \/ max(|5"|, \B 9 \) of inverse power law type. We may assume 
that g(B$, B b ) is large, since otherwise the inequality holds automatically for large 
enough C . 

Recall the description of B b as the set of all (x',Xd) such that x' G C b , and \xd — 
h{x')\ < p, for a certain function h which depends on B 9 . This implies that 

\B i nB b \ < min(p»,p b )|C s nC b |. 
max(p«, pp) 

If ^ °(B^,B b ), this concludes the proof. In the same way, 



1 max(|C«|, \C 9 \) Vl M {J 

maxdC'l, 



The quantity ^^^jw] is comparable to the sum of the second and third terms in 

the definition of g(B$,B). The desired inequality follows, if either of these terms is 
> g(B^,B b ). Moreover 

d-l d-1 

|(#+C»)n(a; b +C b )| < C(^2 \(x*'-£',e§\/r) + ^ |(#-x b ', e b )|/r b ) max(|C«|, |C b |), 

j=i i=i 
so the conclusion holds if either the fourth or fifth term in the definition of g is largest. 

Consider next the final two terms in the definition. Define Q^(y) = yd — {%t)d ~~ 
\y' — x\'\ 2 andQ b (y) = yd — (x\)d— \y' — x b J\ 2 . Consider the case where the eighth term 
is large, in the sense that |Q"(x b )| > ~g(B$, -B b )p", and the first seven terms in the 
definition of g are all < co£?(-B", B 9 ) for a suitably small constant Co, and moreover, 

max(p«,p b )/min(p fl ,p b ) < g(B\ B b ) 1/2 . 

Then x b <£ BK We aim to prove that |-B"n.B b | is relatively small. To this end consider 
the quadratic polynomial P : R d_1 — > R defined by P(z) = Q^(z,t(z)) where t(z) is 
chosen so that Q 9 (z,t(z)) = 0, that is, t(z) = (x\)d + \z — x 9 J\ 2 . Observe that 

If \P(z)\ > p b + p tt then 5 s n B 9 n ({z} x R) = 0. 

In the case which we are now analyzing, P satisfies 

\P{x b )\>\g(B\B 9 )pK 



Define 



3max(p«,p b ) ^ _, ni D bx-i/2 



g(Bt,B*)pt 
On C\ 

\P(z)\ > eg(B\B 9 )p* = 3max(p tt ,p b ) > p tt + p b , 
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for all y outside a set of measure < Ce c |C t ' | . 
We conclude that 

|5»nfi b | < C7e c min(p«,p b )|C b | 
< C'q{B\B'')- c '\B^\ 

for some c',C G R + . This is the bound required. 

Everything is symmetric in the indices Jj, b, so the case where the ninth term is 
large requires no further discussion. Likewise it suffices now to treat the case where 
the sixth term is large, the seventh term being handled by symmetry. 

Suppose then that \(x\' — x b ',ef)| > cg(B$, B b )p^ fti for some index i. To simplify 
notation write u = x\' and v = x\' . If y = (w, t) G R d_1 x R belongs to B^ D _B b then 
t satisfies \t — (x\)d — \w — u\ 2 \ < p^ and \t — (x\)d — \w — v\ 2 \ < p b . Subtracting gives 

|2(w, u — v ) — s\ < 2 max(p", p b ) 

where s = 2\u\ 2 — 2u ■ v. The value of s is little consequence. With respect to the 
basis e", the z-th component of u — v has absolute value > cg(B$, B b )p^/ri. w ranges 
over a translate of C"; in particular, (w,el) satisfies the sole constraint (w, ef) < Tj. 
This forces 

\{w G x v + C i : \2{w,u-v) -s\< 2max(p fl ,p b )}| < Cq(B\ J B b )" 1 |C fl |, 

uniformly in all s6l. This implies the required bound, by a repetition of arguments 
given above. □ 

There is an adequate quasi-triangle inequality for g. 

Lemma 3.8. For any three paraballs, 

(3.12) q(B\ B b ) < Cq{B\ B^f + Cg(B\ B^f 

where C < oo depends only on the dimension d. 

By exploiting the transitive action of we may assume without loss of generality 
that Br = B(z, e, r, p) where z = (0,0), e is the standard basis for R d_1 , r = 
(1, 1, • • • , 1) and p = 1. Reasoning like that in the proof of Lemma [3.71 then contols 
the parameters specifying in terms of r\~ x = g(B*,B° i ) 1 and likewise for B b . It 
then follows directly from the definitions that g(B^, _B b ) < Cr]~ c where now r\ is the 
maximum of the two quasidistances. 

3.4. Lorentz spaces. 

Definition 3.9. Let / be a nonnegative function which is finite almost everywhere. 
A rough level set decomposition of / is a representation / = Xljl-oo ^ fj where fj 
is supported on a set Ej, 1 < \fj(x)\ < 2 for almost every x, and the sets Ej are 
pairwise disjoint and measurable. 

Any (complex-valued) function / which is finite almost everywhere on R d admits such 
a decomposition, which is unique modulo redefinition on sets of Lebesgue measure 
zero. As shorthand for such a decomposition we will write "/ = £V 2^fj, fj -h- Ej" . 
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A quasinorm for the Lorentz space L p ' r (M. d ) is 

j 

with the natural interpretation for r = oo. For p > 1 and r G [1, oo] there exist 
equivalent expressions which satisfy the triangle inequality. These spaces are nested: 
L p ' r C L p ' s if r < s, and the inclusion is proper if r < s. See [T2] for further 
information. 

3.5. Analytic preliminaries. We review here four facts established in \3\. The first 
of these results asserts that any quasiextremal function for inequality (11.21) is closely 
connected with the characteristic function of some paraball of comparable L p norm. 

Lemma 3.10. [3] For any e > there exist c,C G 1R + with the following property. If 
f G LP is a nonnegative function with rough level set decomposition f = Y^jez^ fj> 
fj -H- Ej, and if ||T/|| 9 > then there exist an index j and a paraball B such 

that 

Pf j -XB\\ P >ce c \\f\\ p 

and 

\B\ < \E 3 \. 

This is Theorem 1.5 of [3]. 

It is often useful to work with the bilinear form (g,Tf). The following result 
connects quasiextremal pairs (/, g) with dual pairs of paraballs, in the basic case 
when /, g are both characteristic functions of sets. 

Proposition 3.11. [3] There exist positive finite constants C, c, 7, depending only on 
the dimension d, with the following property. Let E, E* be measurable subsets of IR d 
satisfying < \E\, \E*\ < 00. Define e > to be 

(Xe*, TXe) 

£ \£\d/(d+l)\fi*\d/(d+l) ■ 

Then there exists a pair B = (B, B*) of dual paraballs such that 

\B\ < \E\, \EHB\> ce 7 |£|, 

< \E*\, > ce 7 |£*|. 

Under these hypotheses, Theorem 1.2 of [3] gives \B\ < \E\, \B*\ < \E*\, and 
(T( XE nB),XE*nB*) > cb c (T( X e), Xe*) = cs c+1 \E\ d /^\E*\ d ^ . 

Since (T(xsns), XE*nB*} < A\E n B \ WW+V \ E* fl B*\ d ^ d+1 \ the lower bounds \E n 
B\ > ce c \E\ and \E* fl B*\ > ce c \E*\ stated in Proposition 13. Ill follow directly. 

Although the inequality ||T/|| 9 < is dilation-invariant and is the only L p (M d ) 
L q (M> d ) inequality valid for T, it is nonetheless a suboptimal inequality within the 
more general context of Lorentz spaces. The following is Theorem 1.6 of [3]. 

Proposition 3.12. T maps L p,r boundedly to L q for all p < r < q. 
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The following was a principal ingredient in the proof of Proposition 13.121 and will 
be needed again below. See Lemma 9.2 and inequality (9.10) of [3]. 

Lemma 3.13. For any d > 2 there exist C,C < oo with the following property. Let 
E, E', F C M d be measurable sets with positive, finite measures. Let 77 G (0, A] . If 
Txe{x) > r]\E\ 1/p \F\- 1+1/p and T X e'(x) > r)\E'\ 1/p \F\- 1+1 / p for every x G F , then 
\E'\ < C'r]- C \E\. 

The exponents in the hypotheses are natural; the hypotheses imply for instance 
that (xf,T X e) > 7]\F\ l / p \E\ l ' p . 

4. Distant paraballs interact weakly 

The following lemma may at present seem unmotivated, but will later provide, in 
the proof of Lemma I7.2[ the geometric input for perhaps the most central step of our 
analysis. By a partition of a set we will always mean an expression as a union of 
pairwise disjoint subsets. 

Lemma 4.1. For each d > 2 there exists C < 00 with the following property. Let 
i] G (0,1]. Let {B a : a G S} be an arbitrary finite collection of paraballs in M. d 
satisfying g(B a , Bp) > Crf c whenever a ^ (3. Let F cM d be a Lebesgue measurable 
set of finite measure. Then F can be measurably partitioned as F = U a( zsF a in such 
a way that 

(4.1) (XF,,T X B a ) < vIF^IB^p whenever a^(3. 

Here C depends only on the dimension d, not on rj. 
Proof. Define 

lp = \ V \F\- 1+1 ^\B^ 

and 

F f} = {xeF:T XB0 (x)> 1 p}, 

noting that 



|Vp 



(4-2) (x F \F a , T X bJ < [ la< la\F\ = lv\F\ 1/p \B a \ 

J F\F a 

Choose pairwise disjoint measurable sets Fp C Fp so that UpFp = UpFp. Their 
union is not necessarily all of F, but F^ — F \ UpFp already satisfies (xfU Txb q ) < 
|iy|F| 1 / p |5 Q | 1 / p for every a by (14. 2p . so it suffices to prove that for all a ^ j3, 

(4-3) (XFpiTXB.) <lv\F\ 1/p \B a \ 1/p . 

We prove f!4.3[) by contradiction. Suppose that there exist indices a 7^ (3 G S for 
which (14.31) fails to hold. These indices will remain fixed for the remainder of this 
proof. We aim to prove that g(B a , Bp) is small, contradicting the hypothesis. 

Set J = Fp n F a . Then (x Fp \p a , Txs a ) < l^Fp^B^ as in flO}, so 

(4.4) {xjr, T XBa ) = (fa, ~ X F ,\F a ), T XbJ > iv\F\ 1/p \B a \ 1/p . 
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Since (xt, TXB a ) < A| J r | 1//p |.B Q | 1//p by definition of A, this forces 

\F\ > 3"VA" P |F|. 

Given B a , Bp, F as above, apply Proposition 13. 1 II with E = B a , E* = F to obtain 
a pair B a (z a ,e a ,r a , p a ) = (B a ,B^) satisfying 

\B a \ < \B a \, \B$\ < \F\ < \F\, 

\B a n B a \ > crf\B a \, \B% C\F\> crf\ F\ > crf\F\ 

for certain constants c, 7, whose values have changed from one occurrence to the next. 

Set T = F fl B". We know already that \F\ > erf \F\. Since T^b^x) > 7/3 for 
every 1 6 D 7 D J, 

0^,T Xs ,) > 7/3 |^| = |r7|J^| • |Fr l+1 /*|fl//* > crf\F\^\Bp\^ 

for certain positive constants c, 7. Consequently Proposition 13.111 can be applied 
again, this time with E = Bp and E* = F, to obtain a pair B^(zp, e@, rp, pp) = 
(Bl 3 , Bl ) satisfying 

1^1 < \Bp\, l^f I < 1^*1 < IH 

\B p nBp\> crf\Bp\, \B% n J*| > cr/ 7 |J^| > cr/ 7 |F| 

for certain constants c, 7. 

Since B@ D B" D B@ D B" (1 F = B@ D F, 

\B%r\B2\ > \B% HF\ > crf\F\ > crfmax.(B^,B^). 
By Lemmas 13.71 and 13.61 this implies that 

g{B a ,B^) < Cr]- C . 

Since \B a \ < \B a \ and \B^\ < \Bp\, while \B a n B Q | > crf\B a \ and n B p \ > 
crf\Bp\, one has 

g(B^Bp) < Crf c and g(B a ,B a ) < Cr]- C 
by Lemma 13.71 Therefore by the quasi-triangle inequality of Lemma 13.81 

g(B a ,Bp)<C V - c . 

This contradicts the assumption that g(B a , Bp) is sufficiently large. □ 

5. Step 1: Entropy refinement 

According to Proposition 13.121 the inequality ( 11.21) . while scale- invariant, is not 
sharp within the scale of Lorentz spaces. From this lack of optimality there follows 
useful information. 

Lemma 5.1. For any d > 2 there exists C < oo with the following property. Let 
e > 0. Let f be any nonnegative measurable function in L p (R d ). Then there exist an 
index set S C Z of cardinality \S\ < Ce~ c and a function f satisfying < / < / 
with rough level set decomposition f = ^2j e g^fj, such that 

(5-1) ||T/||,>(l-e)||T/||,. 
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Proof. Choose any r e (p,q). Let / have a rough level set decomposition / 
^ jgZ 2- J /j, fj -H- Let r/ > be a small parameter. Define 

S = {j: V\E 3 \^ > T]} and f = ^ J fr 

Then 

11/ - f%,r = E( 2i i^-i 1/p ) r = E( 2i i^i 1/P ) p ( 2 'i^i 1/P ) r_p 

< r/ r - p ^(2 J '|E i | 1/p ) p = rf~ 

by Holder's inequality. Therefore 



•pi I f hp 
p 



\T(f-f)\\ q <C\\f-f\\ p , r <C V 



1 -p/r II f ||p/r 



where C < oo is the norm of T as an operator from L p,r to L q . Moreover, 



rf\S\ = J2^ P <J2 2jP \ E i\ - 

jes j 



We may assume without loss of generality that ||/|| p = 1. Then defining rj to satisfy 
Crr x ~ p l r = e gives the conclusion stated. □ 

Lemma 5.2. Let f > satisfy ||T/|| g > (1 — 5)A||/|| P . Then the function f in 
Lemma I5.il can be chosen to satisfy 



(5-2) \\f-f\\ p <C(e + 5)^ 

in addition to all the conclusions of Lemma \5.1\ 
Proof. Construct / as in the proof of Lemma 15.11 Since 



||T/|| 9 <A| mip 
and 

||T/|| 9 >(l- e )||T/||„ 

it follows that 

\\f\\ p >A-\l-s)\\Tf\\ q . 
Since f,f — f have disjoint supports, 

ii/ -f\\*= wm -\\f\\ p P 

<\\f\\P-(l-erA- p \\Tfr q 

< ||/|g-(l-e)fA-P(l-«y) p Al/|g 

= [i-(i-sni-5r] 

<{Ce + CS)\\f\\ p P - 



□ 
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The upshot is that near-extremals have low entropy, in the sense that relatively 
few terms in their rough level set decompositions suffice to approximate them to a 
specified degree of accuracy. 

We have implicitly also established the following variant of Lemma 15.21 

Lemma 5.3. There exists c,C G M + with the following property. Suppose that 
< / G L p , and let f satisfy ||T/|| 9 > (1 — <5)A||/|| p , and have rough level set 
decomposition f = ^\ 2^fj, fj -h- Ej. Then for any r\ G (0, 1], 

2j h p <c(s 1/p + v c )\\f\\ P - 

j:2i|%lVP <7? ||/|| p 

6. Step 2: Weak higher integrability 

In Lemma [5. II we obtained a very weak form of precompactness, in the form of an 
a priori bound on the cardinality of the index set S in the sum / = YljeS ^ fj- The 
next step is to show that the indices in S cannot be far apart from one another. 

Lemma 6.1. There exist constants c,C,C G (0, oo), depending only on the dimension 
d, with the following property. Let p G (0, 1). Let f be a (1 — 5)-quasiextremal for 
(11 . 21) satisfying \\f\\ p = 1. If S < cp c , then there exists a function f satisfying 
11/ — /||p < Cp c with a rough level set decomposition f = J^j^fj suc ^ ^at if both 
||2 l /i|| p > p and ||2 J /j|| p > p, then 

\i-j\<Cp- e 

Proof. By Lemma 15.11 there exists a function / with rough level set decomposition 
/ = J2jes^fj> fj Fj, with \S\ < C p~ c such that ||/ — f\\ p < Cp c , and moreover 

\\2 J fj\\p > P for every j G S. 

The operator dual to T is identical to T under conjugation by a simple change 
of variables. Thus Lemma [5.11 applies equally well to it. Therefore by duality, there 
exists h G LP satisfying \\h\\ p = 1, with rough level set decomposition h = J^kes^^k, 
hk 4-> Fk, with index set S C Z satisfying |5| < C p~ c , so that 

(h,Tf)> (1-25) A. 

Set = |5| + \S\ < Cp-°. 



Set 



M = max \i — j\ 

ijes 



It is possible to partition S into two nonempty disjoint sets S = S$ U S so that 
\i ~ j\ ^ M/N whenever i G and j G S^. Fix any such partition. 

Let rj > be a small parameter to be chosen below; it will depend on iV and thereby 
ultimately on p. Partition each of the sets F k as F k = F^ U F fc b U Fj measurably, so 
that 

(i) For each x G F^ there exists j G 5" such that TxeAz) > 7 7|-^fc| _1+1 ^ p |-E'j| 1 ^ p ; 

(ii) For each x G F b k there exists j G S b such that Tx^.(x) > r]\F k \~ 1+1 / p \E j \ 1 / p ; 

(iii) For each x G F^, T XEj (x) < r 1 \F k \- 1+l l p \E j \ 1 l p for every j G S. 
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Set 

h b = Y,2 k hkX F £i h* = Y,2 k hkX F h 

fceS keS 
je5 b jes« 

For any j G S and k E S, 

(2 k h k ■ Xfi , T(2*/j)> < 2 k ^ 2 (x F t,Tx E] ) 

k k 

< 2 k+ i +2 ri\Fi\ ■ iFkl^+^lEjl 1 ^ 

< 2 k+J+2 r ] \F k \ 1/p \E j \ 1/p 

<4#UI/IIp 
= Ar]. 

f = f + / b where / b have disjoint supports; likewise h = + h!' + h\ with 
disjointly supported summands. Consequently, as in the proof of Lemma [5 .1[ 

(h*, Tf) + (h\ Tf) < A(l - Cp c )\\f\\ p \\h\\ p = A(l - C7p c ) 

since ||./% > p, \\fX >p,l=\\h\$> W\\P+\\h% and 1 = ||/||| > = 
\\f% + \\f\\ Pp . Thus 

(h, Tf) < CN 2 V + (h\ Tf) + (h\ Tf) + (h\ Tf) + (h\ Tf) 
< CN 2 r] + A(l - Cp c ) + (h\ Tf) + (h\ Tf). 

Set 

(6.1) r] = c p Co , 

where Cq, Cq are respectively sufficiently small and sufficiently large constants. In 
particular, choose cq, Co so that CN 2 r] + (A — Cp c ) < (1 — 35) A; this is possible since 
N < Cp~ c . Thus 

(6.2) (h\Tf)>cp°, 
or 

{h\Tf)>cp c . 

It is no loss of generality to assume (16.21) . since the situation is symmetric with respect 
to interchange of the indices (j, b. 

There must exist k G S and j G such that 

(6-3) (x^,^.}^^!^! 1 ^!^-! 1 ^ 

otherwise we would have a total bound of CrjN 2 for the left-hand side in (16. 2p . which 
would be a contradiction due to the choice of rj. 
Let 

= {xe Ft : T XSj (x) < l V \FinF k \^\E^}. 
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Then 

so 

Since 

we must have 
Thus 



(x,*.t,TxB,>< jT/|F*|Vi'|^ i |VP | 

k 

(X F ^^T XE] ) >\ri\F k \ llp \Ej\ llp . 

k v k 

, t ,T X£j )<A|F fc b \F^| 1/p |^r /p , 



n\^\>cp u \F h \. 



for every x G \ F^. 

There exist an index i E S b and a subset J 7 C F^\ F k '* such that Tx^(x) > 
ri\F k \~ l+l l p \Ei\ l l p for all x G J 7 and 

k v fc 

Since (x^Txe,) < A| J r | 1 / p |E j | 1 / p , it follows again that (J 7 ) > cp c |F fe |. 
Therefore 



1^1 


-l+i/p 




|i/p 




-i+i/p 







for every £ G J 7 . By Lemma 13.131 this forces 



1. 



< Cp -0 !^! and l^-l < Cp -0 !^). 

Since > p, pf < 2 p 2 pi \Ei\. On the other hand, 2 p ^\E j \ < 

Therefore 

|J5,-| < C2 (i - j)p p- p \Ei\. 

In conjunction with the reverse bound \Ej\ > cp°\Ei\ proved above, this forces \i — 
i|<Clog(l/p). 

This conclusion holds for a certain pair G S* x SK The partition S = S" U 5^ 
was chosen so that M < N\i — j\. Therefore M < Cp~ c . □ 

Remark 6.2. This proof is on track to establish higher integrability in the natural 
form <ft*f G Ifi for some Q > p for extremals, until the very last step, in which \i — j\ 
turns into N\i — j\. Perhaps some more efficient reorganization is possible. 

Corollary 6.3. There exist a finite constant C and a function ^ : (0, oo) — > (0, oo) 
satisfying ^/(t)/t p — > oo as t — > oo and as t — > 0, with the following property. For any 
e > there exists 5 > such that for any nonnegative function f satisfying \\f\\ p = 1 
and ||T/|| g > (1 — 5) A, there exist <fi G Gd an d a decomposition <fi*f = g + h such that 



*(g) < C. 



g, h > satisfy \\h\\ p < e and 
(6.4) 
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Proof. Fix any exponent r E (p,q)- Let rj > be a small quantity to be chosen at 
the end of the proof. Let 5 be a sufficiently small function of rj, e. 

Let / have a rough level set decomposition / = X^gz 2 J /j, /j -H- £7j. By Lemma [5731 
if 5 > is sufficiently small then there exists at least one index k such that ||2 fe /fc|| p > 
Co, where Co > is an absolute constant which depends only on the dimension d. By 
choosing 4> E Qd to be an appropriate dilation symmetry <j){xf,Xd) = (rx',r 2 Xd) we 
may reduce to the case where k = 0. 

By Lemma 16.11 there exists M < oo such that ||2 J '/j|| p < rj whenever \j\ > M. 

Define h = E|i[>M 2J '7j, and 9 = J2\j\<M 2i fj- Then INIW = £|j]>M PVjllp < 
^r-p ^ ^ ||2 J '/j-||^ < r] r_p . By the proof of Lemma l5T2"j this implies that \\h\\ p < e 
provided that S, r\ are chosen to be sufficiently small. 

If rj, 5 are chosen to be sufficiently small then by Lemma 16.11 for any p > rj, 
\j\ < Cp~ c whenever ||2 J '/j|| p > p. Since the sets Ej are pairwise disjoint, for any 
nondecreasing function = V ; u / M/(2'/ ; ) < E b1 < M *(2'' +1 )|£,-|. 

Let S fc = {j : |j| < M and ||2 J 7il| P e (2 - *- 1 , 2~ fc ]}. Then for all j E S k , \j\ < C2 Ck 
by Lemma [6.11 Therefore 

je5 fc jeS k 



Moreover, 



— ck 



by Lemma [5.31 Thus 



£ $>(2^)|^| < £ c7(5 + 2- 

2k< ri -ij<=S k 2 k <rj- 1 



max 



Choose $ to be a nondecreasing function satisfying the growth condition 

> 2 c/c max ^7---^ < oo, 



fc=0 

and ty(t)/t p — > oo as t — > and as i — > oo. Then choose 5 to be a function of 77, 
satisfying 

> max — t— > < 1. 

t—l \j\<C2Ck 2p(J + 1 ) ~~ 

This completes the proof. □ 



7. Step 3: Spatial localization 

Lemma 13.101 leads to the following preliminary result concerning the geometric 
structure of quasiextremals. 

Lemma 7.1. For any e > there exist 5 > and N, K < 00 witt the following 
property. Let f > be any (1 — 5)-quasiextremal for inequality (jl.2p satisfying 
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|| = 1. Then there exists a function F with a rough level set decomposition 
F = Y.jas 23F v F i E v satisfying 

0<F<f 
||TF||,>(l-e)A, 
\i — j\ < K f or oil i,j £ S, 
and for each j G S there exist N paraballs Bj ti such that 

(7.1) Ej C U^Bm 

(7.2) Y^lBjA^Cie^Ejl 

i 

Proof. In light of results already proved, by modifying / by a function whose L p 
norm is < e/2, we may suppose that / has a finite rough level set decomposition 

/ = Yujes^fji fj F v with \i ~ j\ — K( £ ) f° r au hj e S- Let n > be a small 
quantity. Then ||T/|| 9 > (1 — 5) A > rj, so we may apply Lemma 13.101 to find a 
paraball B and an index %\ such that I^/^xbIIp > p and \B\ < I-Eij, where p > 
depends only on rj. Set g± = 2 ll f il XB and write / = g\ + h\. 

This was step 1 of a construction which we iterate, as follows. At step n, one is 
given a collection of paraballs {B m : m < n — 1}, along with a decomposition 

n-1 

/ = 9m + 
m=l 

such that the sets _B m = £> m \ Ui <m Bi and functions g m , /i n _i satisfy 

# m = 2 lm f im XB m for some i m G Z, 
I B m | < | .£/ j m | , 

h n -l = f ■ XM. d \U m < n ^B m - 

Since < h n -i < /, ||^n-i|| P < H/Hp < 1- If ||T/i n _i|| g < 77 then the construction 
terminates. Otherwise invoke Lemma [3.101 to find a paraball _B n and an index i n such 
that the function 

g n = 2 l "fi n ■ XR d \u m <nB m ■ Xb u 

satisfies ||<7 n ||p > P an d \B n \ < \E in \. 

The functions g n are nonnegative and have pairwise disjoint supports, and Ylm=i 9m — 
f. Consequently X)m=i HflWillp < ll/llp — 1> so this process must terminate after at 
most p~ p iterations. If it terminates at the n-th step, then set F = Yjm=i 9m- Then 
|| T(/ — F)\\ g < 77, so by Lemma [5T2l ||/ — F\\ p < \e provided that r\ is chosen to be 
sufficiently small. Defining the collection {Bj^} to be {B m : i m = j} for each index 
j produces a collection of paraballs satisfying fl7.ip . fl7.2p . □ 

Consider momentarily the possibility of a sequence {f u } of quasiextremals which 
are characteristic functions of sets E v satisfying |_E„| = 1. If these sets were to move 
off to spatial infinity as v — > 00, then /„ and fl would converge weakly to zero, 
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preventing the extraction of an extremal as a limit of some subsequence. If each E v 
were a paraball, then this situation could be rectified by invoking the symmetry group 
Qd to replace each E v by a a paraball independent of v. However, there is potentially 
a more problematic obstruction: If each E v were a disjoint union E v = E' u U E", 
with \E' U \ = \E'l\ = | and with E' v ,E'l moving to infinity in different directions, then 
the symmetries would not suffice to produce a useful renormalized sequence. The 
following refinement of Lemma 17.11 rules out this sort of obstruction. 

Lemma 7.2. For any e > there exist 5 > and K, A < oo with the following 
properties. Let f > be any (1 — 5)-quasiextremal for inequality (jl.2p satisfying 
|| /||p = 1. Then there exist a function f and a paraball B such that 

< / < /, 
ll/llp>l-e, 
||T/|| ff > (1 - e)A, 

and f admits a rough level set decomposition f = Yjjes^fj w ^ a distinguished 
index J G Z satisfying 

\j ~J\<K for all j G S, 
fj is supported in XB for all j G S , 
2 J \B\^<C\\f\\ p . 

This improves upon Lemma 17.11 in that the collection of paraballs Bj^ in the con- 
clusion has been replaced by a single paraball, which however must be expanded by 
the factor A. This is a geometric analogue of the replacement of an upper bound on 
the cardinality of the index set S in Lemma 15.21 by an upper bound on its diame- 
ter in Lemma 16.11 Lemma 17.21 follows directly from the combination of Lemmas 14.11 
and 17.11 in the same way that Lemma 16.11 followed from Lemma 15.21 combined with 
Lemma 13.131 Details are left to the reader. 

8. Weak Convergence and Extremizers 

Let / be a (1 — <5)-quasiextremal satisfying ||/|| p = 1 for 5 sufficiently small. Apply 
Lemma \7. 21 Then by replacing / by <p*f for an appropriately chosen G Qd, we may 
reduce to the case where in the conclusions of that lemma, J = and the paraball B 
is B = {x G M d : \xj\ < 1 for all 1 < j < d — 1 and \xd — \x'\ 2 \ < 1}. Thus we have 
the following information. 

Lemma 8.1. There exist a constant Co < oo and functions \& : [0, oo) — > [0, oo) and 
p : [1, oo) — > (0, oo) satisfying ty(t) > t p for all t, ^/{t)/t p — > oo as t — > and as 
t — > oo, and p(R) — > as R — > oo ; such that for any e > there exists 5 > with 
the following property. Let f be any nonnegative function satisfying \\f\\ p = 1 and 
||T/|| 9 > (1 — 5) A. Then there exist <p G Qd and a decomposition 

<P*f = g + h 
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with g, h > satisfying 

\\h\\ P <e, [ V(g)<C , [ g p < P (R). 

JW. d J\x\>R 

Moreover, there exists a nonnegative function F satisfying \\F\\ p = 1 and 
(F,Tg)>(l-e)A, [ ^(F)<C , [ F p < p(R). 

JR d J\x\>R 

The bound for J ^S(g) is Corollary 16.31 The bound for j^ >R g p follows in a similar 
way from Lemma 17.21 

With these uniform bounds in hand, it is straightforward to derive most of the 
conclusions of Theorem 12.21 For any sequence of functions {f u }, we write f u — ^ / to 
mean that J f u (p — > f ftp as v — > oo, for every compactly supported continuous test 
functions <p. If f u are nonnegative L 1 functions, then f v —^f implies \\f v \\i — >• \\f\\i, 
provided that sup„ J\ X \ >R fu — > as R — > oo, 

Proof of existence of extremizers. Let {f u } be any extremizing sequence. Then by 
the preceding lemma, there exist (j> v G Qd such that 0*/„ = g u + h v where \\h u \\ p — > 0, 
while the functions g v satisfy the other conclusions of Lemma I8.1j and there exist 
corresponding functions F u satisfying the same bounds as g u , with = 1 and 

(F v ,Tg v )^A. 

It follows directly from the Banach-Alaoglu theorem that after passage to a subse- 
quence of the index v, F p — F p and g p — g p form some F, g G LP . Since J F p i] < 1 
for every continuous, compactly supported function rj satisfying 1 1 ?7 1 1 oo < 1, necessarily 
j F p < 1; likewise j g p < 1. 

We claim that 

(8.1) {F v , Tg u ) — »■ (F,Tg). 

Since \\F\\ P < 1 and likewise \\g\\ p < 1 and (F u ,Tg u ) — > A, it follows that ||-F|| P = 
||^ ||p = 1 and that g is an extremal. 
To prove ( 18 .ip define 

9u,\{x) = gv(x)x\x\<\(x)Xg„(x)<\(x), 

and define F V) \, F^ x ' in the corresponding way. For any compactly supported 
function rj G Co(M d ), the operator / i— > rfTrjf is smoothing in the sense that it 
maps L 2 (M, d ) boundedly to the Sobolev space H s (M d ) for s — (d — l)/2. H s embeds 
compactly into L 2 in any bounded region. Therefore the weak LP convergence of 
9v,\ to as v — > oo implies L 2 norm convergence of T(g v \) to T(g^) as v — > oo, 
for every fixed A. Therefore (F VtX ,Tg v>x ) -> (F^,Tg^) as v — > oo, for every fixed 
A < oo. 

The conclusions of Lemma T8. II guarantee that g v% \ — > and i 7 ^ — >■ in L p norm 
uniformly in ^ as A — > oo. This uniform convergence, together with the convergence 
proved in the preceding paragraph, give (18. ip . □ 



LP norm convergence of g v to g will be proved below. 
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\ d q = A d+1 \\f\\ d p +1 . 



Proof of part (iv) of Theorem \2.2[ Let / be an arbitrary nonnegative extremizer sat- 
isfying \\f\\p = 1. For any n there exist (p n £ and a decomposition </>*/ = g n + h n 
where g n satisfies the inequalities of Lemma IHTTj and \\h n \\ p < 2~ n . By passing to 
a subsequence, we may arrange that g n converges in LP and almost everywhere to 
a limit g, which is an extremizer. Since 0* preserves the LP norm and ||/i n ||p — > 0, 
(f)* n f — > g and — > f in LP norm. It is an elementary consequence of the 
concrete description of Gd that this forces {(f)^ 1 } to be a precompact family of dif- 
feomorphisms of lR d . By passing to a subsequence we conclude that / = <p*g for 
some £ Gd- Thus / satisfies all of the conclusions stated in conclusion (iv) of 
Theorem O □ 

9. Loose ends 

Proof of the Euler-Lagrange identity (12. 7p . Let / be any nonnegative extremizer sat- 
isfying \\f\\p = 1. Since q = d + 1 is the exponent conjugate to p = (d + l)/d, T* has 
the same norm as T, as operators from LP to L q . Then 

A^H/Hf 1 = (T/, (T/) d ) = (/,T*([T/] d )> 

< ll/ll P ll^(T/) rf || g < A||/|| p ||(T/) d || p 

= A||/|| p ||T/||J = A fl 

The first inequality is an application of Holder's inequality. Since the extreme left— 
and right-hand sides are equal, both inequalities in this chain must be equalities. 
Equality in Holder's inequality forces T*([T/] d ) to agree almost everywhere with 
some constant multiple of f 1 '* 1 . The chain of equalities then forces this constant to 
be A d+1 . □ 

Lemma 9.1. If f is a nonnegative extremizer for (|1.2|) . then f(x) > for almost 
every x £ M d . 

Proof. Let / be a nonnegative extremizer satisfying ||/|| p = 1. Consider initially the 
set G consisting of all points expressible in the form z + (s', \s'\ 2 ) — (f, \t'\ 2 ) such that 
z is a Lebesgue point of {y : f(y) > 0} and s' ^ t' £ W 1 ' 1 . We claim that f(x) > 
for almost every x £ G. If not, choose some subset G C G satisfying < \G\ < oo. 
Consider the functions f e = f + exc- Then ||/ 6 ||J = 1 + 0{e p ) = 1 + o(e) as £ -»■ 0. 
On the other hand, 

WTfeWHl > \\Tf\\iXl + (d+ l)e / (T/) d T X(5 . 

Now because /, Xg are nonnegative functions and T is defined by convolution with a 
nonnegative measure, / (Tf) d T X G > if and only if jTf- T X g = (T*T/, Xg) > 0. 

T* o T is defined by convolution with a measure which has a continuous, strictly 
positive Radon-Nikodym derivative with respect to Lebesgue measure, on the open 
set {(V, |s'| 2 ) — (If, \t'\ 2 ) : s' 7^ t'}. The condition that G C G thus ensures that 
T*T/ > at every point of G. Therefore (T*T/, x G ) > °- 

Consequently ||T/ e ||(i + i > A + ce for some c > for all sufficiently small e > 0, 
and therefore ||T/ e || 9 /||/ e || p > A + c'e for small positive e for all d < c, contradicting 
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the extremality of /. Thus any extremizer must be positive almost everywhere on 
some open set. 

This additional information can be fed back into the above argument, which then 
demonstrates that / > almost everywhere at every point z + (s', \s'\ 2 ) — (t', \t'\ 2 ) 
where z varies over a subset of full measure of some ball. One more iteration estab- 
lishes the conclusion. □ 

Corollary 9.2. Let f be a nonnegative extremizer for (11. 2p . Then for any compact 
set K C M. d there exists c > such that f(x) > c for almost every x G K. 

This follows from the Euler-Lagrange equation by reasoning already used in the 
proof of Lemma I9.1[ since T* o T is expressed by convolution with a nonnegative 
measure /i with the property that any point is expressible as a finite sum of elements of 
an open set on which fi has a continuous, strictly positive Radon-Nikodym derivative. 

It remains to be proved that any extremizing sequence has a subsequence which 
converges in L p norm, rather than the weak convergence proved above. 

Lemma 9.3. Let {f u } be an extremizing sequence for inequality (11.21) .- thus f v > 0, 
\\fu\\p = 1; an d \\Tf u \\ q — > A. Then there exist a sequence of symmetries {4> u } C Gd, 
an extremal F for (jl.2p . and a subsequence {f Uk } such that §* v ,fv F in L^ d+1 '' d (M. d ) 
norm. 

Proof. After passing to a subsequence, we may choose <p u so that (plf u = F u + h u , 
so that ||/ix/|| p — > 0, ||i^|| p — > 1, F£ — ^ F p for some nonnegative extremal F satis- 
fying ||F|| P = 1, and F U ,F satisfy the higher integrability and spatial decay bounds 
provided by Lemma 18.11 Moreover, there exists H > satisfying \\H\\ P = 1 and 
T*H = AF P_1 almost everywhere. 

As in the proof of (18. ip it follows from the a priori bounds and Rellich's lemma 
that after passing to a subsequence, (F U ,T*H) (F,T*H) = A(F,F P " 1 ) = A. 
Therefore (F U ,F P ^ 1 ) — > 1 = (F, F p_1 ). 

For any v and any small 5 > denote Eg >u = {x : F p (x) < (1 — 5)F p (x)}. Recall 
that q = d + 1 is the exponent dual to p — (d + l)/d. Then 



Because F > almost everywhere, this implies that for any compact subset K C 
R , \Eg )V D K\ — > as u — > oo. Together with the higher integrability and spatial 
decay bounds of Lemma 18.11 this implies that f E F p — > also. We may choose 

5 = 5{y) to tend to zero as v — > oo, yet still have 




Therefore 
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Now 

f _ pp) <§ [ F p <5. 

JFE(x)>(1-6)Fp(x) J 

Therefore by splitting the integral into the two regions F~p < (1 — 5{v))F p and (1 
8(u))FP < Fp < FP we conclude that 



(9.1) / (F p — FP) — >■ for each 5 > 0. 

Jf„<f 

Since / F* = J Fp, Q£T) forces / \F P - F%\ -> 0. □ 

One final conclusion of Theorem 12.21 remains to be established, the existence of a 
decomposition g = g^ + g where H^Hc 1 < R an d llfi^llp < v(R)i where r\ — > as R — > 
oo. Equivalently, if £ £ C°°(IR d ) is compactly supported and satisfies £(0) = 1, and 
if M p is the Fourier multiplier operator with symbol 1 — C(£/)°) then ||M p g||p < fj(p) 
where fj(p) — > as p — > oo. Here g is a (1 — <5)-quasiextremal which satisfies f!2.4[) 
and (12.5 j) . and 5 is permitted to depend on i? and/or p and can be chosen to be as 
small as may be desired. 

Proof of (T2j|. Fix F satisfying \\F\\ p = 1 such that (g, T*F) > (1 - 5) A. Let e > 0. 
By the results proved above, if p is sufficiently large then T*F may be decomposed 
as T*F = + # b where = for all \£\ > p/2 and < e. 

Split g = g^ 1 + g 9 where gtt i s supported where |£| < 2p, g b is supported where 
|£| > p, and ||<^||p + ||y ||p < C < 00; this may be done with C a constant depending 
only on d, p. To do this, set g b = M p g where M p has symbol 1 — ((£/p), with the 
auxiliary function £ £ chosen to satisfy £(£) = 1 whenever |£| < 2. Then g b will 
continue to satisfy (12. 4p and ( 12. 5p . uniformly in p, g so long as p > 1. 

We aim to prove that if 8, e are sufficiently small and p is sufficiently large, then 
|| g b ||p is less than any preassigned quantity. 

Consider g = g^ — g b '. Then 

(g,T*F) = (g,T*F)-2(g\T*F) 
= (g,T*F)-2(g\H b ) 

> (l-5)A-2||(7 b || p ||iJ b || 9 

> (1 - 5) A - 2Ce. 



Thus 



g\\ P >(1-S)- 2CeA 



-1 



The same reasoning gives 

(9.2) \\g%>(l-8)-CeA- 1 . 

Since 1 < p < 2, there exists c p < 00 such that for any s, t £ R, 
||s + t| p + ||s - t| p > |s| p + c p min(|s| p - 2 |t| 2 , |t| p ). 

Therefore 

\\g(x)\* + \\g{x)\P > \gt(x)\P + Cp min(|^x)rV(a;)| 2 , \g\x)\>) 
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for all points x G R d . Therefore 

l- (e + 6)> J \gt\* + c p J min (b»rVlM#)- 
Let e > be another small quantity. Then 

1 -0(e + 5) > \\g% + ce 2 - p [ \g\x)\ p dx 

and consequently by ( 19. 2p . 

e 2 ' p [ \g\x)\ p dx = 0{e + 5) 

J\g\x)\>e\gt{x)\ 

and hence 

y\\ p p < e p \W\\ p P + Ce p - 2 (e + 6) < Ce p + Ce p - 2 (e + 6). 

By choosing first e,5 to be small, then e = (e + S) 1 ^ 2 ^^, we arrive at the desired 
bound. □ 

It is natural to ask whether the set of all extremizers of ( 11.21) satisfying \\f\\ p = 1 
is unique modulo the action of Gd- We can offer only the following weak substitute. 

Corollary 9.4. Let {F u } be any sequence of extremizers of (jl.2p which satisfy 
\\F u \\ p = 1. Then there exist a subsequence {F Uk } and a sequence of symmetries 
4>k G Gd such that the sequence {4"%F Vk \ is convergent in L p (M. d ) norm. 

This follows directly from Lemma 19.31 



10. On the symmetry groups 

To list all elements of Gd,d, we employ coordinates (x,y) = (x',Xd',y',yd) with 
x', y' E W 1 ' 1 and x d , y d G R 1 . Gd,d is a subset of the set of all elements $ of Diff (R d ) x 
Diff (R d ) of the form 

(10.1) (x',x d ;y',y d ) h-> (Lx' + u, tx d + a + x' -v + Q(x')] Ly' + u, ty d + d + y' -v + Q(y') j , 

where L, L are linear endomorphisms of v, v G lR d_1 , ^ t G K, a, a G R, and 

Q,Q : IR d_1 IR are homogeneous quadratic polynomials. The proof of the next 
lemma is a straightforward verification, left to the reader. 

Lemma 10.1. A mapping $ of the form (110.11) belongs to G d ,d if cmd only if all of 
the following equations hold: 

—a — a + 2u ■ u — \u\ 2 — \u\ 2 = 

L*L = tl for some t G R \ {0} 

v + 2L*(u-u) = 

v + 2L*(u-u) = 

Q(x') = \Lx'\ 2 -t\x'\ 2 

Q(y') = \Ly'\ 2 -t\y'\ 2 . 
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Given any L,u,t,a,v there exist Q,Q, L,u,a,v satisfying all these equations, and 
these quantities are uniquely determined by L,u,t,a,v. 

Definition 10.2. A collection of d points Xj G R d is said to lie in general position 
if, writing Xj = (xj t i, • • • , Xj^), the d x d matrix 

/xi,i a?i,2 ••• a?i,d-i 1\ 

#2,1 ^2,2 ' ' ' #2,d-l 1 
W,l ^d,2 • • • Xd,d-l V 

is nonsingular. 

This definition is coordinate-dependent. The d-th. coordinates of the points Xj do 
not enter into consideration. 

Lemma 10.3. The action of Qd on R d is generically d-fold transitive. That is, for 
any two sets {xj : 1 < j < d} and {yj : 1 < j < d} of d points in general position in 
M d , there exists <fi G Qd satisfying 

4>{xj) = yj for all 1 < j < d. 

Proof. There exists a unique affine endomorphism x' \- > Lx' + u of M.^ 1 which maps 
x'j to y'j for all 1 < j < d. Fix any ^ t G R. Define Q(y') = \L(y')\ 2 - t\y'\ 2 for 
x' G M"' -1 . Let a G R and v G R d_1 be parameters to be specified. Consider the 
diffeomorphism G Qd defined by (f>(x', xa) = {Lx' + u, txd + a + v ■ x' + Q(x')). 
We merely need to choose these parameters to satisfy the equations 

tx j:d + a + v ■ x'j + Q(x' j ) = y jtd for all 1 < j < d, 

which is to say, 

v ■ x'j + a = y j4 - |L(4) | 2 - t(x j4 - \x' j \ 2 ). 

Since t has already been specified, this is a system of d inhomogeneous linear equations 
for (y , a) G R( d-1 ) +1 . The assumption that {xj : 1 < j < d} is in general position 
means precisely that this system has full rank d. Thus for any t, there exists a 
(unique) solution. □ 

The proof also demonstrates that is uniquely determined, up to the "vertical" 
scaling factor t. 

11. Affine surface measure 

Our definition of T uses the measure dt, rather than the surface measure on the 
paraboloid induced from its inclusion into M. d . This measure dt is entirely natural 
from a geometric viewpoint, which we now explain. 

First, we recall the definition and properties of affine arclength measure. This is 
a measure on a subinterval / C R, associated to any sufficiently smooth mapping 
7 : i" — > R d , as follows: Define 

L 7 (f) = |det (y(0,7 (2) (*),---,7 W (*))| 
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where 7 (fc) (t) = d k -f(t)/dt k . Then the measure crJl) is defined to be 
(11.1) <7 7 (I) = / |L 7 (t)| 2/d(d+1) dt. 



See [8] for some discussion. 

Affine arclength measure cx 7 enjoys two natural invariances, which provide its raison 
d 'etre: 

(i) For any A G GZ(R,d), a Ao7 (J) = | det(A)\ 2 / d ^a 1 {I). 

(ii) For any injective C d mapping : / — > R, a 10 <j>{I) = ^(^(-O)- 
Both identities are easily verified. 

There is a natural analogue in the codimension one case, which we call affine 
surface measure. Let F be any C 2 mapping from an open subset of to M. d . 
Write F(t) = (Fi(t), • • , F d (t)). For each pair of indices i,j G {1, 2, • • • ,d — 1}, form 



Fij(t) = det 



' ati 9t d _i » 

9F 2 (t) ... 9F 2 (t) 9 2 F 2 (t) 
9ti &td~i dtidtj 

\ (9ii dtd-i dtidtj / 

From these quantities i^j we form a (d — 1) x (d — 1) matrix, and consider its 
determinant 

For any Borel set C/ C contained in the domain of F, define the affine surface 
measure of U to be 



£ F (t) = det {F itj {t))TL- 



a F (U) = / \C F {t)\ 1/id+1) dt. 
Ju 

Affine surface measure enjoys corresponding invariances. 



Proposition 11.1. Let A G Gl(M.,d) and let : V —¥ U be a C 2 diffeomorphism of 
an open set V C with a subset U of the domain of F. Then 

(11.2) a AoF (U) = | detiA^-^/^o-piU) 

(11.3) ^(V) = a F (0(VO). 

In the case where F(t) = (ti, ■ ■ ■ , td-i, |^| 2 )j affine surface measure is a constant 
multiple of Lebesgue measure on which is the measure used in our definition of 

T. Thus our use of this measure, rather than of surface measure on the paraboloid, 
has a natural geometric context. 

Proof of Proposition \11.1\ Property (111.21) follows directly from the definition; pass- 
ing from F to A o F multiplies each by det (A). To derive (111.31) . note first that 
if G = F o 0, then for any indices i,j, the determinant Gij(t) depends only on the 
Jacobian matrix D<p(t) and on the first and second partial derivatives of F at <f)(t). 
Indeed, the chain rule produces an undesired contribution involving first derivatives 
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of F and quadratic expressions involving D<p in the rightmost column, but this contri- 
bution is a linear combination of the first d — 1 columns and hence can be eliminated 
via row operations. 

Thus matters reduce to the case where is linear, and it is no loss of generality to 
suppose that t = and F(0) = 0. We claim then that 

(11.4) |/W0)| = |det(0)| d+1 |£ F (O)|. 

Taking into account both the formula dr = | det(-D0)| dt, where r = 0(t), and the 
exponent l/(d + 1) in the definition of <Jfo(/>, would then yield (111.31) . 

To derive (111.41) . the invariance (111.21) already established can be used to reduce 
matters to the case where F takes the form F(t) = (ii,t2, •'* ,td-i,f(t)) where 
Df(0) = 0. Then 

(F 0)^(0) = det(0) -J2 d hf(°)M,j 

k,l 

where dfjf denotes the second partial derivatives of /, and = {fpk,i) d k ^ =v Thus 

C Fo4> (0) = det(0)^ • det (E W %/(0)^w)yi • 

This last matrix is simply the composition 0* o d 2 f(0) o where 0* denotes the 
transpose of 0. Therefore 

£ Fo ^(0) = det(0) d - 1 • det(0) 2 det(<9 2 /(O)) = det(0) d+1 £ F (O), 

which is (HUH). □ 

Inequalities for operators defined by convolution with (the push-forward onto a 
curve of) affine arclength are studied in [7] and [UJ. See the former paper for ref- 
erences to other related works. One may ask whether there are analogues, for hy- 
persurfaces equipped with affine surface measure, of the results of [7], [TTJ for curves 
equipped with affine arclength measure. 
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